Gray Codes and Paths on the zi-Ctibe 

By E. N. GILBERT 

(Manuscript received May 27, 1957) 

Certain problems in coding and in switching theory require a list of 
distinct binary n-luplcs such that each differs from the one preceding it in 
just one coordinate. Geometrically, such a list corresponds to a path which 
follows edges of an n-dimcnsional cube. This paper finds all types of closed 
paths on cubes with n Sj 4. For larger n, a process given here will produce 
large numbers of paths. 

I. INTRODUCTION 

A Gray code is a means of quantizing an angle and representing it in 
a binary alphabet. The encoding is such that angles in adjacent quantum 
intervals are encoded into n-tuples of binary digits which differ in just 
one place. For example, taking n = 3, as the angle increases from 0° to 
360°, the binary code for the angle might go through the succession 
000, 001, 011, 010, 110, 111, 101, 100 and back to 000. Gray codes are 
used when the encoding is performed by a code wheel. At angles close 
to the boundary between two quantum intervals, any of the digits 
which change at the boundary are likely to be in error. In a Gray code 
there is only one such questionable digit, and a mistake in this digit only 
gives to the angle the code for the adjacent quantum interval. 

Although the Gray code example given for n = 3 is easily generalized 
to obtain the well known Gray (reflected binary) code for any n, there 
are, in general, a large number of other codes which also change one 
digit at a time. Our problem is to find these other codes. In special 
applications, some of the others may be preferable to the conventional 
Gray code. For instance, it may be desirable to use other numbers of 
quantum intervals beside powers of 2; powers of 10 might be a natural 
choice. If the quantity being encoded is a length rather than an angle, 
one can drop the requirement that the first and last quantum interval 
have codes differing in just one position. Then there is a still larger 
variety of encodings from which to choose. 

For n ^ 4, we will exhibit all possible codes which recycle (i.e., are 
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suitable for angles). As n grows beyond 4, the number of possibilities 
soon becomes enormous ; we will give a procedure by which a large num- 
ber of codes may be constructed. The purpose of this paper is to con- 
struct, classify and catalogue codes, but no attempt will be made to 
single out those codes which are particularly useful. 

In searching for codes the following geometrical picture is helpful. 
The set of edges of a unit n-dimensional cube forms a linear graph which 
we will call the n-cabe graph Q n . Q n has 2" vertices, each labeled by an 
n-tuple (.Tj , • • • , x„), Xi = or 1. Two vertices are joined by a line of 
Q n if their coordinates agree in all but one of the n places. We are inter- 
ested in the paths and cj'cles of Q n . By a path of length L is meant a 
set of lines (7oT r i , ViV* , • • • , T* L _iT r L ) of Q„ where the vertices F , 
• • • , V L are distinct. Cycle of length L is similarly defined but with 
Vi , • ■ ■ , V L distinct and Vo = V L . If we follow a path or cycle of Q n 
and interpret the /^-tuples which label the vertices a.s codes for succes- 
sive quantum intervals, we obtain an encoding in which only one digit 
changes from interval to interval. Thus, our problem is to find all paths 
and cycles of Q n . The cycles are the encodings suitable for angles. 

A permutation of digits in the w-tuples of a code produces a new code 
which is not significantly different from the original one. Its code wheel 
is obtainable from the original wheel merely by permuting tracks. Simi- 
larly, a complementation (interchange of and 1 in certain coordinate 
positions of the n-tuples) is a minor change. These operations correspond 
to rotation and reflection symmetries of the cube. The symmetry group 
of Q n is the hyperoctahedral group 0„ of order 2"n\. The typical sym- 
metry operation of 0„ consists of one of the n\ possible permutations of 
the coordinates of the n-tuple, followed by one of the 2" possible comple- 
mentations. Two paths or cycles will be called equivalent or of the same 
type if one can be changed into the other by applying to Q„ one of the 
symmetry operations in 0„. Although there are tremendous numbers of 
paths and cycles, it suffices to give just one of each type. Even our list 
of distinct types of cycles becomes rather long at n = 4. The exact 
number of types of cycles is not known for larger values of n. We give 
below a procedure for constructing large numbers of paths and cycles. 
If L = 2" — 1, we can specify in advance the numbers Ni , • • • , N n , 
where N k is the number of times x k changes as one follows the path from 
Vo to V h . A similar specification is possible for cycles of length 2". 
The cycles of this length contain every vertex and thus are the Hamil- 
ton lines of Q„ (Ref. 2, Ch. 2). Even the number of types of Hamilton 
lines is found to grow rapidly with n; there are nine types of Hamilton 
lines for n = 4. 
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Another application of paths on Q n may be found in switching theory. 
Here the labels on the vertices represent the states of a relay network 
with n relays. Coordinate x t is or 1 according to whether the ith. relay 
magnet is turned off or on. Since it is physically impossible to change the 
state of two relay magnets precisely simultaneously, the state of the 
entire network can change only by following lines of Q„. If the network 
is a counting circuit it is made so that its state follows a path or a cycle 
according to whether the network is intended to lock or recycle at the 
end of the count (Ref. 1, Ch. 11). 

II. -COORDINATE SEQUENCES 

A path is specified completely by listing the L + 1 vertices V , 
■ ■ • , V L in order. For example, on the 3-cube, a list might be 

000, 001, 011, 010, 110, 100, 101, 111. 

Ignoring the starting vertex, a more compact notation is to list in order 
only the coordinate places in which the change occurs. In the example 
cited, one would obtain (3231232). This L-tuple of coordinate places 
will be called the coordinate sequence for the path. 

The vertices of a path might equally well have been written down in 
the reverse order V L , • • • , V . Hence the list of coordinate places writ- 
ten in reverse order [i.e., (2321323)] will be regarded as another notation 
for the same coordinate sequence. 

Each coordinate sequence represents not only the given path but also 
every other path obtainable from it by one of the 2" complementations. 
All of these paths are of the same type; it suffices here to study coor- 
dinate sequences rather than the paths themselves. 

Two paths, P and P', are of the same type if and only if one of the 
n\ permutations of coordinates changes the coordinate sequence of P 
into the coordinate sequence of P' . Our problem thus becomes one of 
classifying coordinate sequences into symmetry types with respect to 
the symmetric group on the n coordinates. 

In a similar way, a cycle (say 001, 011, 111, 101, 001) may be repre- 
sented by a coordinate sequence (2121). Now, however, in addition to 
the two orders in which the coordinates may be written, there are L 
different vertices at which the list of changing coordinates can begin. 
Thus, for any cycle there may be as many as 2L distinct L-tuples of 
digits, all of which are considered to be trie same coordinate sequence. 

Not every list of digits is the coordinate sequence of a path or a cycle. 
A typical list, such as (121323131), represents a way of wandering along 
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lines of the cube graph, possibly visiting some vertices more than once. 
In the example cited, the vertices visited after the first and the sixth 
steps are the same. In the intermediate steps (213231) all digits k = 1, 
2, 3 appear an even number of times; for all k the net change in Xk dur- 
ing these steps is even, i.e., zero. This observation leads simply to the 
following result. 

Theorem I. An L-tuple A = (a\ , a% , • • • , a L ) a; = I, • • • , n is the 
coordinate sequence of a path of Q n if and only if every one of the L{L -f- l)/2 
blocks of consecutive digits (a* , cti+i , • • • , ay) contains at least one of the 
n digits an odd number of times. A is the coordinate sequence of a cycle of 
Q n if and only if every one of the blocks of length 1, • • • , or L — 1 contains 
some digit an odd number of times while A itself contains every digit an 
even number of times. 

As an illustration of Theorem I, we construct a simple cycle of length 
2 n (Hamilton line) on Q n . For k = 1, 2, • • • , 2" define the /cth digit of 
the 2 "-tuple to be the number 

ak = Max (d such that 2 ~~ divides k). 

That one obtains a cycle thereby is easily proved by induction on n. 
When n = 2, the construction yields (1212), the coordinate sequence of 
a cycle on the square. For larger n, the construction yields a 2 "-tuple of 
the form 

A n = (B n ,n,B n , n), (1) 

where B„ is the (2" _1 — l)-tuple obtained from A„-i by removing the 
n — 1 in the 2"~ place. Consider any block C of q < 2" consecutive 
digits of A n . If C does not contain one of the two n's, it is a block of 
the path B„. If C contains only one n, n appears an odd number of 
times. If C contains both n's, the digits of A„ not in C form a block of 
B n and contain some digit k an odd number of times; since k appears 
an even number of times in A n it appears an odd number of times in C. 
In any case, some digit appears an odd number of times in C. The Hamil- 
ton line in question corresponds to the conventional Gray code. 

III. THE 4-CUBE 

Since cycles are of greatest interest, we have constructed a list of all 
types of cycles on Qi. This list also includes all the cycles on Qs and Q 3 , 
since such cubes are contained as subgraphs (faces) of Q4 . There are 
no cycles of odd length, since every coordinate must change an even 
number of times. The numbers of types of cycles of lengths 4, 6, • • • , 16 
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are 1, 2, 7, 10, 23, 20 and 9. Of these, the cycle of length 4 is found on 
Q 2 ; both cycles of length 6 and one cycle of length 8 are found on Q z . 

The computation consisted of the enumeration of a large number of 
cases using whatever ad hoc simplifications could be found. For exam- 
ple, consider any cycle of length L which contains no pair of points which 
differ in all four coordinates. Complementing all four coordinate places 
changes this cycle into a new disjoint one. Then 2L ^ 16. It follows that 
every cycle of length 10 contains such a pair of "diametrically opposite" 
points. The cycle can be cut into two paths joining these points, one of 
length 4, the other of length 6. Hence, coordinate sequences for types of 

cycles of length 10 can all be written in the form (1234 ), and 

only the ways of rilling the six empty places must be enumerated. 

It is also helpful to draw a picture of the 4-cube in such a way that 
certain equivalences between cycles becomes geometrically obvious. 
The diagram described by Keister, Ritchie, and Washburn (Ref. 1, Ap- 
pendix to Chap. 8) is convenient. Their cube has the appearance of a 
piece of graph paper; it is agreed that any two points which can be con- 
nected by a path made up of horizontal and vertical line segments of 
length 4 represent the same point on the cube. The graph paper itself 
has symmetries, each of which is also a symmetry of the cube. Then any 
two cycles which, when drawn on the graph paper cube, can be trans- 
formed into one another by a sequence of translation, reflection or rota- 
tion symmetries of the graph paper must be of the same type. Unfor- 
tunately, these symmetries account for only 128 of the 384 symmetries 
of the 4-cube. Cycles which are equivalent with respect to the 4-cube 
group Oi may fall into as many as three distinct types with respect to 
the subgroup of graph paper symmetries. This phenomenon is illustrated 
in Fig. 1, which shows three cycles which are of the same type in spite 
of the differences in their graph paper pictures. 






Fig. 1. — Graph paper representation of cycles on Q 4 
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Table I — Coordinate Sequences of Cycles of 
Distinct Type on Q\ 



Length 2 

Length 4 

Length G 

Length 8 



Length 10 



Length 12 



Length 14 



2) 
3 2 



3 4 
3 4 
3 4 



3) 
3) 
2 1 



3) 

4) 
3) 

4) 
4) 
2) 
2 1 
1 3 



1 4 



3 1 

3 1 



1 3 
4 1 



4) 
4) 
3) 

4) 
3) 
3) 
4) 
4) 
3) 
4) 
3 4 
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Table I — Concluded 



(12134231234213) 
(12 13 12 14 12 13 2 4) 
(1213121412312 4) 
(1213121421312 4) 
(1213124121312 4) 
(12 13 12 4 12 13 2 14) 
(12132124121314) 
Length 1G (12 13 12 14 12 13 12 14) 
(121312142123212 4) 
(1213212412 13212 4) 
(121321242321323 4) 
(12132124 3 231232 4) 
(1232123432123214) 
(123212341232123 4) 
(1232123413121314) 
(121341424321234 3) 



A complete list of coordinate sequences for distinct types of cycles on 
Q t is given in Table I. 



IV. COMPOSITION 



The example (1) suggests the following result. 

Theorem II. Let the L-tuple A = (fli , • • • , a,) and the M-tuple B = 
(Jh , • • ■ , on) be coordinate sequences of two paths in Q n -\ ■ Then the (L 
+ M + l)-twple {A, 71, B) = (ay , • ■ ■ , a L , n, hi , • • ■ , b u ) is the coor- 
dinate sequence of a path in Q„ . If each of the digits 1, ■ • • , n — 1 appears 
an even number of times among d\ , • • • , a h , bi , • • • , b M , then (A, n, B, 
n) is the coordinate sequence of a cycle in Q n . 

A formal proof is easily given along the lines of the example. However, 
we prefer to note only that the theorem is obvious geometrically. We 
may envision Q„ as composed of two (n — 1) -cubes R and S. An extra 
coordinate x„ = is added to the labels on the points of R and, simi- 
larly, x„ = 1 for S. Then 2" _1 lines are added joining corresponding 
points of R and S in order to construct Q n . The path obtained by fol- 
lowing the A path in R, then stepping over into 8 and following the B 
path is represented by (A, n, B). If all digits appear an even number of 
times in ai , • • • , a h , h , • • • ,b M then the two-end points of (A, n, B) 
differ only in their x n coordinate; an additional step back to R returns 
one to the starting point and (.4 , n, B, n) represents a cycle. 

The ^dimensional path and cycle (if it exists) so constructed will be 
called the composite path and composite cycle of the paths with coordinate 
sequences A and B. Of the nine Hamilton lines listed, only the last is 
not composite. 
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A large family U n of paths and cycles on the re-cube can now be con- 
structed inductively. For n = 2, the family consists of all paths and 
cycles. When U n -i has been constructed, U n will contain all members 
of Un-i plus all paths and cycles obtainable by making a composite 
from a pair of paths of U n -\ • It will be convenient to admit single 
vertices as paths (having the null coordinate sequence). Then such 
coordinate sequences as (n, A) with .4 in ( r „_i , or even (n), represent 
paths in U n ■ Next, we complete the construction of U„ by adding all 
paths and cycles equivalent to the ones just constructed. Paths and 
cycles belonging to U„ will be called ullracomposite. 

The Hamilton line given as an example following Theorem I is ultra- 
composite, as may be seen using (1) and induction. The first five of the 
nine Hamilton lines in Table I are ultracomposite. 

V. CHANGE NUMBERS 

Let A be a coordinate sequence and let JV* , h = 1, • • •, n, be the 
number of appearances of the digit h in A. N k will be called the Ath 
change number of .4. The change numbers of any other sequence A' of 
the same type as A are just a rearrangment of Ni , • • •, N„ . Hence a 
comparison of change numbers often suffices to prove two coordinate 
sequences to be of different type. There are, however, many examples 
of coordinate sequences of different type but having the same set of 
change numbers (the list of Hamilton lines contains two with change 
numbers 6, 6, 2, 2 and five with change numbers 6, -i, 4, 2). 

If .4 and B are coordinate sequences of paths on the (n — l)-cube 
and have change numbers N t , • ■ • , N„-i and il/ x , • • • , M „_i , then the 
composite (A, n, B) has the change numbers 

Ni + Mi, ••■,JV„_ 1 + i¥ n _ 1 , 1. 

This observation suggests the possibility of an arithmetic test to decide 
whether a given set of numbers Ni , • • ■ , N n are the change numbers of 
some ultracomposite. 

In what follows we call an ?i-tuple (iVi , ■ • • , N„) a word if Ni , ■ ■ •, N n 
arc the change numbers of an ultracomposite path. We ignore ultra- 
composite cycles because they may be found merely by adding single 
steps to those ultracomposite paths which have words in which all but 
one change number is even. 

Following the inductive definition of U n given above, an inductive 
scheme for computing all words may be given: 

(i) (0,0), (0,1), (1,1) and (1,2) are words. 
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(ii) If (.V, , • • •, N n ) and (.V,', • • •, N n ') are words; so are (N x , ■■-, 
N„ , 0) and (iVi + AY, • • -, N n + AV, 1). 

(iii) Any permutation of the change numbers of a word produces an- 
other word. 

Thus, the only words with n = 3 are (000), (001), (Oil), (012), (111), 
(112), (113), (122), (123), (124), (133), and their permutations. The 
following theorem gives an arithmetic property of words. 

Theorem III. Lei the change numbers of a word be written in numerical 
order, ai ^ a 2 ^ ■ • • ^ «» • Then for all k — 1, • • • n, 

Z a t ^ 2" - 2*"\ (2) 



Proof: (Induction on n) 

When n = 2, all words satisfy (2) (see (i) above). Suppose (2) is 
true for words of lengths ^ // — 1 and consider one of length n. 
If ai = 0, the ultracomposite path described by the given word lies 
entirely on a lower-dimensional face of Q„ • Then (a 2 , ■ ■ • , a„) is a word 
of length n - 1 and thus satisfies (2). If Oi = 1, then the given word 
describes a composite path made from two paths B and C. For i = 2, 
• • • , n, a, is a sum B { + C t of certain change numbers of B and C. Then 
for k = 2, • • •, n, 

Z«- = t,B< + t,d ^ 2 bt+ Z «, (3) 

)=/,• i=k i—k i=k— 1 j=A— 1 

where bi , • • ■ , 6„_i are the numbers B 2 , • • • , B„ arranged in numerical 
order and c, are similarly defined. Since the bi and c,- satisfy (2), the 
sums on the right of (3) total ^2" - 2*"\ Finally, when k = 1, (2) is 
necessary because there are only 2" vertices in Q„. 

Not every n-tuple satisfying (2) is a word. An example is (0002). It 
is immediately recognized as not a word because every word either must 
contain some l's or contain only 0's. Even ruling out such obvious 
examples, one still finds others, such as (114). In the case of paths of 
length 2" — 1 the following stronger result is obtained. 

Theorem IV. A set of necessary and sufficient conditions that an n-lwple 
composed of numbers a,- , ai ^ • • ■ ^ a» shall be the word for an ultra- 
composite path of length 2 n — 1 is that {2) holds for k = 2, • • • , n; that 
ai = 1 and, in addition, that 

Ea, = 2"-1. (4) 
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Proof: 

Given an ultracomposite path, (2) is necessary for k = 2, • • • , n by 
Theorem III. Also cti = 1, since any composite has some coordinate 
changing only once. Equation (4) is the requirement that the length of 
the path be 2" — 1. 

Conversely, suppose the given a, satisfy the stated arithmetic condi- 
tions. If n = 2, the only possibility is a x = 1, a 2 = 2 and the coordinate 
sequence in question is (212). If n ^ 3, we are able to use a lemma of 
C. E. Shannon (Ref. 4, p. 84). Given a set of numbers bi £ b 2 ^ ■ • • ^ b n 
containing a pair b r , b s with 1 < b r < b, , a flow operation is defined 
which replaces b r by b r + 1 and b s by b a — 1. Shannon shows that those 
sets of numbers which arc obtainable by repeated flow operations start- 
ing from the initial set 1, 2, • • • , 2"~ are exactly those sets Oi ^ ■ • ■ ^ a„ 
which satisfy our (2), (4) and «i = 1. The lemma in question states 
that if 1, a 2 , • • •, a„ is such a flow pattern then for k = 2, • • •, n 

Oh = Bk -\- Ck , 

where B 2 , • • • , B n are a set of numbers (not necessarily in increasing 
order) obtainable from 1, 2, • • •, 2"~" by means of flows, and similarly 
for C 2 , • • • , C n • By induction, (B 2 , • • • , B„) and (C 2 , • ■ ■ , C„) are them- 
selves words for some paths B and C. Then (B, 1, C) is a path having 
the word (cr-i , • • • , a n ) and the theorem is proved. 

The decomposition of a k into a sum B k + C k can be done either by 
inspection or using the procedure by which Shannon's lemma is proved. 
Thus the problem of finding a path for a given word reduces to two such 
problems in lower-dimensional cubes. Continuing, one finally requires 
only 2 n ~ 2 paths on certain square faces of Q„ . Since the decomposition 
of the a/, can generally be done in several different ways, there may be 
many types of paths which can be found for a given word. 

VI. HIGHER DIMENSIONS 

For the two applications mentioned earlier, one might suppose that 
a complete list of all types of paths, or only of Hamilton lines, would 
be useful. Such tables must lengthen rapidly as n increases. For example, 
the number H„ of types of ultracomposite Hamilton lines is at least as 
large as the number of words, satisfying Theorem IV, for which a 2 , 
■ ■ -, a„ are all even. An enumeration of such words shows Hs ^ 19. 
A rapidly increasing lower bound on H„ is now given. 

Theorem V. Let s , s x , • • • be defined by the recurrence 



S n +1 = (ft + 1)S„ - (o) S »-2 ( 5 ) 
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and the initial values s = 1, ft = 1, ft = 2. Then II n +2 ^ s„ . For large 
n, an asymptotic formula is 

s n « n!(mr)- (1/ V /4 . (G) 

Proof: 

By Theorem IV the (n + l)-tuple W = (2 1 , 2 2 , • • •, 2", 1) is a word 
for some coordinate sequence T. Let a permutation P, say k — > p(fc), be 
applied just to the first n coordinate. Then T changes to PT with the 
word (2 P(1) , • • •, 2" ( "\ 1). The coordinate sequence S(P) = (T, n + 2, 
PT, n + 2) describes an ultracomposite Hamilton line in which the kth 
coordinate change number is 

' v *< J ; \2 if /.' = n + 1 or n + 2. 

Among the ■/?.! Hamilton lines S(P) there are at least as many distinct 
types as there are distinct sets of coordinate change numbers. It follows 
from the uniqueness of binary notation that 2" + 2 b = 2 C + 2 is equiva- 
lent to the statement that the (unordered) pairs (a, b) and (c, d) are 
the same. If one of the coordinate change numbers 2 K + 2 P(K) of S(P) 
equals a coordinate change number of some other fine S(P') then P' 
[say i -> p'(i)] satisfies either p'(K) = p(K) or p'[p(iC)] = A'. Then, 
in order for P' to have the same set of coordinate change numbers as 
P, every cycle of P' must either be a cycle of P or the inverse of a cycle 
of P. To get a lower bound on H n+i , we may count the number s„ of 
equivalence classes of permutations in which P and P' are considered 
equivalent when every cycle of P' is either a cycle of P or the inverse 
of a cycle of P. Precisely this enumeration is also required for Cayley's 
problem of counting the number of terms in the expansion of a sym- 
metric determinant. In this connection, (5) and (6) were given by I. 
Schur. Derivations may be found in Polya and Szego's book (Ref. 3, 
Vol. 2, Ch. 7, probs. 45 and 4G). 

In higher-dimensional cubes, it seems likely that the majority of the 
types of Hamilton fines will not be ultracomposite or even composite. 
To support this guess, we now construct a large class of non-composite 
lines. The last of the nine types listed for n = 4 was one such type. 
Using its coordinate sequence, (1213 • • • 3), we construct sequences of 
the form 

A = Q.,A lt 2,A lt 1,A 3 , ■..,3,A 16 ), 

where each of A t , ■ ■ -, A 16 represents a path of length 2"" 4 — 1 and 
contains only the digits 5, • • •, n. We also require that 5, • • •, n each 
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appear an even number of times. Each such A is the coordinate sequence 
of a non-composite Hamilton line. 
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